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PROCESS FOR MAKING GENES ENCODING RANDOM POLYMERS OF AMINO ACIDS 


Background of the Invention 
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^ , ,1 trnP-V is a synthetic polypeptide analog of myelin basic protein <MBP). which is a 

ratio of approximately 6.25.1. •J**£°™^ U y of 23i0r jo daltons. Although the resulting polypep- 
S? IT2£3^£ counts, ihey differ with respect to their amino acid 

tjdes are compn rad lor m asse mble a 23.000 dalton polypept.de composed o 

SnTgUmfac^Tys^ tyrosine in im designated ratios. Purification of one or even a small 

-™ - 

Other random sequence amino acid copolymers reiatec to our "«™ " . ' . ricrrvi -V273" 

faToi. abUitv to suppress experimental allergic encephalomyeims (Eur. J. Immunol. [1973] 3.273. 
tested fc'* 6 ™ 11 ^" UPP Biological activity was observed in EAE assays using COP-1 related 
cTof ieloLngcLges occurs: tyrosine is rep.aced by tryptophan; or glutamic acid 

controlled by synthesizing genes of specific lengths. 

Brief Summary of the Invention 
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The subject" invention concerns' a method for synthesizing genes which encode random P°'y™« J* 
amirT achS A farmer aspect of the invention is the identification of certain polypeptides wh.cr, are 
pressed by L sTntheTc Qenes and which have high levels of biological activity. The general method- 

subject invention is the polymerization of smaU 

o.igo^eo^ 

Sf5SSS?E^ colonies conLning COP-1 gene sequences, e.g.. 1000 co.on.es. 
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■n, efficacy of the pool of recombinant COW polypeptides Is tested in e^™^^"** 
u ,!S2L ^ assavs If effective, the pool of colonies exhibiting activity .s further subdrv.ded 
encephaJornyelrts (t*S jy^TJ^^*™ these smaller pools are tested. By sequental.y 

to this approach. wn *h-«r nanes and the polypeptides produced by the 

multiple sclerosis. A preferred copolymer «*>**»Q J^'JJJ ™ or molecular weight can be 
aianine. iysin e, an copo.ymer further includes 
synthesized us.ng the procedures of 1 *e suDjeci in « copo | yme r may consist of alanine, lysine. 


Brief Description of the Drawings 


Figure 1 depicts the general methodology for synthesizing random genes and identifying polypep- 
tides ^f^^ specific strategy for synthesizing genes encoding random-sequence porypep- 

29 tideS ' Figure 3 shows all possible 3-amino acid combinations and their percent occurrence in COP-1. 
Figure 4 shows 9-nucleotide duplexes and adaptors for COP-1 gene synthes.s. 

M dUPleX Ffgure 7 is a Western btot showing the fusion protein produced by four different Cones. 

Fioure S shows variations of random-gene synthesis using small duplexes. „„,„ mar - 
F Sure 9 Sowl synthesis of single-stranded 'DNA encoding random sequence ammo aad PO\yme^ 
figure M rSUK I^phcsphoramidite trinucleotide for random gene synthes.s us.ng a DNA syn- 

35 ^^Rgure 11 shows the DNA and amino acid sequences of rCOFM-77 

treated with myelin basic protein. rCOP-1-19. or rCOP-1-77. 
40 Detailed Description of the Invention 

One strategy for synthesizing genes encoding » 
« The example Illustrates the synthesis of gene > «mpoMd ' <* ^ DN * £f ^ ends - represe nted 

ing the duplexes are synthesized and ar ™ sScSr ends^gn!^d the ends are Joined in an 

X^rei^^^ 

50 SeC.KSSrJS Xr^pXe^es encoded by 9 »ese 9 enes have s,n,a, 
amino acid compositions but different sequences. modifications of the procedures used 

to synthesize a gene encoding a defined amm ^ *?2^"lhVd!p? *es can be joined together In any 
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-r w « orderina Ihe sticky ends at each Junction must be unique. Thus, the process of the subject 
*1S n^at ^dom-seouence genes are synthesized using oligonucleotide dupiexes 
an^no small segments of amino acids, and the sticky ends on each duplex are the same. 

Orproc^duTfor synthesizing genes encoding recombinant COP-1 polypeptides enta,ls usmg 
r duota*. encoding segments of 3 amino acids. All possible permutafcons of the 3 amino 

^£Sr£ compriTedTL four COP-1 amino acids and their percent occurrences in COP-1 are 
^JTrZeTiZiv^mKna* COP-1 genes we have synthesized oHgonucleotides correspond- 
fn to le ^9 and noncoding strands for some of the 3 amino acid segments (Figure 4). The 
oltaonuictioS a?e phosphorylated at the 5 ends. Complementary pairs of oligonucleofdes annealed. 

Twi?. the 3' nucleotide extending on each strand: adenosine on the cod.ng strand, and 
hZSne on Z ^noncoSng strand. Adenosine and thymidine base pair with one another thus ensunng that 
STduSexes TreloTneTdtctionally, that Is. coding strands to other coding strands. Since the dup exes 

contro the ^* * J^,^™ ?? one tf, e ligation products into the vector (Figure 5). The relabve 
^^T^JS^SS^TZ the ra'ndom polymers can also be modulated by mixing the 
SSSi SS- «" ^rent ratios. For example, duplexes ^ ^£#~JJ 
dictated bv the percent occurrence of the corresponding ammo acid segments .n COP i * hl9 "/ ° hofSf . 
£1 number* different duplexes incorporated determines the sequence complexity of the synthetic 
The number of d^erent oup usi ng fewer than 64 duplexes but not all sequence 

^^I^SSrS Nuance comp.exity 9 required will depend on the application of the 
P °' y Sf implication arises In producing completely random COP-1 amino ^^^^^Z 

t0 *T ^££££2 Irfmade C « Coned into a suitable expression vector and transferred to 
'" "a'ST^o, pott***. W «««■*. » »•» »" -»»«Q-» « f" *• ^f,?^ 

colonies grown and lysed on nitrocellulose filters. 


55 Materials and Methods 
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Synthesis and Phosphorylation of Oligonucleotides. The coding and "encoding ^JfgL^JJ 
ea JSrariTy nthesized by the phosphite triester method with an Applied B.osystems Mode IZM I DNA 
ea ^ dUP L?^Vfi' ends of oligonucleotides are phosphorylated on the DNA synthesizer usmg (2-{2-{4.4. 
Sth^xyUvt phosphoramidite from Glen Research. 

Hern D°««i'on of Olioonucleotides. The phosphorylated form of the oligonucleotide is separated from the 

££Sft^c^«*- because hydrocylated oligomer causes the Hgation reactions to 

TTtiTSTSS^ SnTcl^ xtures containing 1 nmo. each of a cooing and corr. 
Annealing and Ugation ot ^ Cl. H 7 6f ^ 0J mM e thylenediaminetetraacetic ac.d 

plementary noncodmg strand 100 ™ M Jns H y. P reactions are allowed to cool 

pmol 01 .sen «H».«»* ■*> 1° JTsiOO units of T4 DNA ligss. (Na« England BloW*. Beverly. 
Ration M« ^ M 7S ™7™ 0 \7* 6 ™° „. pol,.,h yl .ne glycol » "» 

Chemical trials. The agarose plug containing the synthetic DNA is stored at -20 C. 
Preparation of Expression Vector 

assigned the access.on number NRRL B-15904. P R = V ^ wa5 £ £ ,. Dromote r. The diflerenc s 

pRBf 2.2. Uke pBG1. pREV2^ expresses inserted genes behind the E. cou prom 
between pBG1 and pREV22 are the following: 

1. P REV2.2 lacks a functional replication of plasmid <«2!$P"*^ |te ^ sequence insure s 

2 pREV2 2 has the trpA transcription terminator inserted into the Aatll site, mis seq 

"™^^^ iTTp^'K^ » -P»„ and «Ho^«-=* pB6, 
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Reluct plasmid. pBG1 N. where the 2160 base pair Nde. fragment is deleted from pBG1 
was selected by preparing plasmid from ampicillin resistant clones and determining the restneton d.geston 
pa^ems w«h MM and sjl. (product fragments approximately 1790 and 1650). This de.etoon .nactrvates the 

^ wtth EcoRl and Bel. and the larger fragment approximately 

2455 ^f^'^randed fragment was prepared by the procedure of .taKura et al. (Itakura. K„ 
J.J. Rossi a^d R-B. Wallace [1984] Ann. Rev. Biochem. 53323-356. and references therem) wth the 
following structure: 

5' GATCAAGCTTCTOCAGTCGACGCATG 

3' TTCGAAGACGTCAGCTGCGTACGCCT 

AGGCCATGGGCCCTCGAGCTTAA 5' 


CGGATCCGGTACXXX3GGAGCTCG 3' 


This fragment has Bc^l and EcoRl sticlcy ends and contains recognition sequences for several restriction 
endonucleases^ ^ ^ ^ ^ ^ *2El 0 T! 

joined with T4 DNA ligase and competent" cells of strain JM103 were transformed. Cells i hato"9*e 
recombinant plasmid where the synthetic fragment was inserted into pBG1 N between the Bell md l2 RI 
STS^-SS by digestion of the p.asmid with Hpa. and EcoRl. The diagnostic fragment s,res are 
approximately 2355 and 200 base pairs. This plasmid is called pREV1. ^ 

2d. 5 ug of pREV1 were digested with Aatll. which cleaves uniquely. 

2e The following double-stranded fr agment was s ynthesized: 
5 CGGTACCAGCCCGCCTAATGAGCGGGUI I I I I I I IGACGT 3, 
3' TGCAGCCATGGTCGGGCGGATTACTCGCCCGAAAAAAAAC 5 

n,;. fenmsni has Aatll stickv ends and contains the trpA transcription termination sequence. 

^^^J^Sm PBEV1 was ligated-with 0.01 ug of the synthetic fragment in a volume of 

20 U ' U 2 t 9 C T et.s D o? ETJM103. made competent, were transformed and ampicil.in resistant Cones 

S * lBCt °* Using a KpnI . ecor, double restriction digest of plasmid isolated from selected colonies a cell 
containmg the corre^Tco-Sstruction was isolated. The sires of the KpnI. EcoRl S-~^tagn»^ 
approximately 2475 and 80 base pairs. This plasmid is called pREVtTT and contains the tnpA transcnpt.cn 

terminator.^ ^ pREV1TTi prepared ^ dlsc , 0 sed above (by standard methods) was cleaved with Ndel 
and Xmnl and the approximately 850"base pair fragment was Isolated. . no 

^S. 5 ug of plasmid pBR325 (BRL, Gaithersburg. MD). which contains the genes 
resistance to chloramphenicol as well as to ampicillin and tetracycline, was cleaved w,th Bel. andthe ends 
h^rS wi^ Klenow polymerase and dexoynucleotides. After inactivating the enzyme, the mixture was 
Sealed wST Ndel ™dtte approximately 3185 base pair fragment was isolated. This fragment contains the 
genes for chioTirnphenicol and ampicillin resistance and the origin of "P**""- • ■ 

9 3c. 0.1 ug of the Nde.-Xmnl fragment from pREVITT and the Nde.-Bcl I l>gm«n hn JS JmTo? 
ligated in 20 ul with T4~" DNA~figase and the mixture used to transform competent cells of stram JM103. 
Cells resistant to both ampicillin and chloramphenicol were selected. 

Cells ™* a * to > EcoRand Nde , doub , e digest of plasmid from selected Cones, a p asm d ^asse^ted 
giving SgrnenJ sizerol approximately 2480. 1145. and 410 base pairs. This is cailed p.asm,d pREVITT/ch. 
and has genes for resistance to both ampicillin and chloramphenicol. 
4a. The following double-stranded fragment was synthesized: . 
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Mlul EcoRV Qal BamHI 

5' CGAACGCGTGGCCGATATCATCGATGG 
3' GCTTGCGCACCGGCTATAGTAGCTACC 
Sail Hindlll Sinai 
ATCCGTCGACAAGCTTCCCGGGAGCT 3' 
TAGGCAGCTGTTCGAAGGGCCC 5' 


This fragment with a blunt end and an SstJ sticky end! contains recognition sequences for several 
restriction eroyrne -J^ ^ Nm| (wh)cn cleaves ^ 20 nudeotides frorr .the Beg 

16 site) and '^(whtcS craves within the mu.tip.0 Coning site). The larger fragment, approbate* 3990 base 
*** 7! S f i^e^S'fgment from pREVITT/ch. and 0.01 ug of the synthetJc fragment were 

^^S^^S^^ — - " S^M^of Ciai. 

20 ReC om^esT. STSL Sp^ng^ w^^ — 

2S sequelfeng it bTdideoxynucleotide sequencing using standard methods. 

4g. This plasmid is called PREVZ2^ procedures. This plasmid was 

SKSSm"- Agrifu.tS P^ 'SSL^ -n •** 20. 1986 and was assigned^ accession number 
30 NR Rasmfc^pREV 2.1 was constructed using p.asm.d pREV 2.2 and a synthetic o.igonuc.eotide. An 
exa mp,e £^^£^5 SSon enzymes Nju, and BamH, and the 4 Kb fragment Is 
5S0,at 1 fr T^e SKruble-strand oUgonuc.eotide is synthesized: 
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5' CGAACGCGTGGTCCGATATCATCGATG 3' 
3' GCTTGCGCACCAGGCTATAGTAGCTAGCTAG 5' 


3 The fragments from 1 and 2 are Hgated in 20 u. using T4 DMA ligase. transformed into competent 

Nrul si ^BamM, sne and recreating these two restjction , sites. yie.d 
— Cloning the-TJgatjon Products into an ^2^2 SnTS ^Sc genes. The agarose p.ug 
exte^iiol^thaTaTe-EoTrTpatible wrth the adaptors at the termini o ^ e ^ « 37 - a Nine mjcoHters 
containing the size-seiected synthetic genes .s meted « "JoS^S SE~*or (20 ng). The mixture 
of me.ted ^«?^^^i^ 8 ^l^^?S and 1 mM ATP and is diluted to 
is adjusted to contan 66 mM Tris-HCI. pH 75. 5 w iwg^ a. incubated overnight at 16 C. 

sjsss.™ ."sjss- - rii~- - ss— - «— * — . - 

plates containing appropriate drugs. Drocedures Including the best mode, for practicing the 

s<*»nt mixture pmporfcn. «. b, volume unless olherra. now!,. 


45 


50 


7 


30 


35 


40 


EP 0 383 620 A2 


Example 1 - Strategy for Synthesis of Random Sequence Genes 

Model studies using several oligonucleotide duplexes were performed to assess this method for 
^JSSna oenes encoding polypeptides of predetermined amino acid composition but random se- 
5 w«e analyzed with respect to size, ligation junctions. compos.tlon. sequence. 

80,1 £?*2T££* within broad size ranges are produced by varying the ratio of adaptors to 9-mer 
duollxe"s We are able to select genes within more limited size ranges by resolving the ligation products on 
aSose geVs a^d excising gel sOces containing products of a certain long* JJsing these P^e-dum. 0— 
» S?fte following three size .ranges have been isolated: 75M 50. 280-320. and 40(WOO nucleotides. 

IbSumSS.. The synthetic genes have been sequenced to demonstrate that the 9-mer duplexes 
are SSnS and iut insertion or deletion of any nucleotides. Correct juncti one are ^necessary 
for Saining the reading frame and thus producing genes that encode polypeptides of the expected 
amir* aSdcL^on. Sequence ana.ysis revea.ed that the junctions between the duplexes are correct 

' S ^Composition To control the amino add composition of the encoded polypeptides, the synthetic genes 
mu st cTnSn th e duplexes added to the ligation reactions. The results from three gene syntheses 
e^rimenTs demonstrated that the synthesized genes are composed of the duplexes mcluded >n the mput 

ao ^^uencrSeach duplex can ligate to any other, the order of the 9-mer dup.exes In the synthetic 
geneTlhS' be random. Thfc randomness is demonstrated in the Example shown in Rgure 6. The 
fynTetic gene shown In this example Is composed of duplexes encoding the foUowing ammo acd 

T££2£ ^m-urf e^ssion .eve.s. synthetic genes are Coned Into vectors such *at the 
as po ^pSU a re expressed either as fusions with heterologous vector-derived peptide ^uences ^ or as 
Sonfosion polypeptides. The levels of expression of the fusion products can be read.ly measured by 
WeSem btot analysis using antisera directed against the vector-derived portion of the f usson prote.n. Rgure 
7 demonstrates the expression of four COP-1 -containing fusion polypeptides. 
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Example 2 - Variations for e y^"-"1 stance Genes with Small DNA Duplexes 

DNA duplexes of other lengths can be used in an approach similar to that described for duplexes of 9 
nudeotides. Since three nuc.eotides code.for one amino acid, strands of 3. 6. 12 15. or 18 "£Mtee£ 
Z Sealed forming duplexes that code for small blocks of amino acids (see Rgure 8). More sequence 
variation occurs by mixing small rather than large duplexes. 

t add-on. terminal extensions of more than one nudeotide can be emp.oyed. Rgure ,8 shows an 
example using dup.exes of 12 nuc.eotides and extensions of 3 " ucleoW ^^/ e P re ^^ n ^ QS ^° n ^ 
Mdons in the duplexes are varied to produce polypeptides of the desired am.no ac.d compos,t,on. Th.s 
broach reTtricts the polypeptide sequences since the am.no acid encoded by the duplex junctions - 
IZ ine (Ala) hftb exampte - is repeated every fourth amino acid. Also, in another variation of me subject 
^^0^0x^5 of 5* nucleotides can be employed instead of the 3 extensions i.lustrated throughout 
this report • 

Example 3 - Synthesis of Single-Stranded Random Sequence Genes 

Gene synthesis using DNA duplexes results In double stranded genes that are ready for doning into an 
exprSsion vedo' An alternative strategy entails producing single-stranded, random sequence genes The 
SS£n of Ms method to COP-1 is illustrated in Rgure 9. Three-nudeotide oligomers correspond.ng to 
coSons for each of tne amino acids In COP-1 are synthesized, mixed in appropriate ratios, and chemically 
pofymerized in solution to produce long sing.e-stranded COP-1 genes. The "^f^*?? «™£ 
Ts made enzymatically using reverse transcriptase or DNA polymerase. The double-stranded DNA is 
soared for donlna by digestion and repairing the ends of the molecules. 

^SSSSS se « u * nce 9enes ^ a,so be made by ** M i* e po 

a DNA svnthesizer Typically, synthetic DNA is assembled one nucleotide at a time usmg phosphoram.d.te 
l^S^^^i^opmi a strategy for synthesizing single-stranded DNA In three nudeotide 
X^S^**** pho P sphoram.d.te trinucleotides (Rgure 10). The us of 3-nuc.eotide bu.ld.ng 
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trinucleotide on the DNA synthesizer. 

Example 4 ■ Repression of Random Sequence Genes in Vector Producing Fusion Proteins 

specific tor 11» vector-derived portion » dependent on tne presence 01 • uur a 
^cSeTf ? £S IXrod^tsion po.ypeptides comprised o, 34 amino 

rsSS-™*-' 4 pr °t uces t ,eve,s 01 

rCOP-1-19. were subclone* from pREV 2.1 to P^2AN. • f*» ™J ^ Nove mber 20. 

has been deposited as »?*^* Ta iso.ated from the P P.EV 

1984 and given the access,on number of NRRL B i 15910. ine ro » ^ s „ nker 

2.1 recombinant plasmids by f^J^^^^^^nMm of the rCOP-1 gene, 
employed in cloning the rCOP-1 genes and the EooR1 s.te « .n ^ with deox . 

After digestion with the restriction enzymes, the ends of *V«£ 1 9 e J 9S are isolated ^ 

p.asmids encode fusion proteins consisting of f^JS^-S^ cSl^Eo from ft. 5 linker 
Methionine ~S ^J^^ ^ ^fcTSSon pUin.^e nuc,eo«de and 

sequence, in order that the LUK i poiypepuuc i ... ay F , aure5 n and 12. respectively. rCOP-1- 

amino acid sequences for rCOP-1-77 '"^^^S^S,^^,^ YKK» AAE, KAK. EKA. 
19 contains • J» encoding ^ fo „ owjng ^ 

5f 21"^^ -3 ~ -e-na, al.ine residue in each sequence Is .eft 
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hahind following CNBr cleavage of the fusion protein. 

Se invention should not be limited to the examples desenbed above. 

fj2Erz c .^ssinn of Random Sequence Gen s in Nojgusioo Vectors 

Syndetic genes can be Coned Into expression vectors such that the polypeptide products are not fused 
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m v^r-derived protein sequences. As In fusion vectors, these non-fusion vectors contain the WW*" 
I ZSSIw T^slationa] signals; however, the synthetic genes are linked directly adjacent to *e 
^^^^^S^wch m a t mey contain a methionine residue as the amlno-termlnal am.no aad. 
^sSe T*ZStn £2LA 0m COP-1 polypeptides by cyanogen brom.de cleavage. COP-1 
X^^SLlhout this amino-termina. methionine are tested for bio.ogical acfvrty. 
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Example 6 - Purification of rCOP-1 Polypeptides 

-rv '.nation of rCOP-1 polypeptides can be accomplished by a number of methods which are well 
u ^ImoselSed Sfs arf For example. E. coB cells expressing Protein A/rCOP-1 fusion protems are 
known to *° s ^" ed ^JSiby centrifugatioZind lysed using a dynamill. The extract is centnfuged to 
grown ^Xs\e?tf con^n 8 M urea, and chromatographed on an S-Sepharose column usmg a 

rCOP-1 polypeptide is purified by gel filtration and reverse phase HPLC. 


20 Example 7 - EAE Experiments 

rCOp'l has been tested for efficacy in suppressing experimental allergic encepha 
descS above. EAE is a T-cel. mediated ^^^^t SiSS V 

25 (Swanborg. R.H. [ISHH] b? exa _ ote oisease is Induced in Hartley guinea pigs by a angle. 
KTJ-od ,!nS 3 - ^U^ESji *** one or both front limbs, can incite '"continence of 

tal protocols and sconng may be use* have recently bee n tested in the EAE experiments. 
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Effects on rCOP-1 and MBP c 

jnEAE 

Group Duration 2 

Incidence 1 

Day of Onset 2 

Max. ! 

Severity* 

Untreated 

rCOP-1-19 
rCOP-1-77 

Myelin Basic Protein 

7/7 
&B 
7/8 
8/8 

13.5*1.0 (13-15) 
16.8*3.7(13-25) 
17.6 * 2.8 (15-22) 
18.1 ±2.6 (15-20) 

2.3 ± 0.6 (1-3) 

2.8 ± 1.4 (1-4) 

2.9 ± 1.0 (1-4) 
2.3H.1 0-4) 

10.8 * 3.8(4-15) 
8.6 3 * 5.2 (2-15) 
g.9 ± 4.5 (2-15) 
5.3 * 5.5 (2-15) 


1 Incidence = iNumuer wiui — 

2 values are the mean standard deviation. Ranges are in parenth ses. 

3 One animal in this group died. 
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■ m - ,rM^- rTOP-1-77 and rCOP-M9 both delayed the onset of dlseas 
The results can be **«r a *2£T£^ £ 0 P-1 molecules did not aff ct other measures of 
using the treatment regimen described here. The rouKi datay9d t and deC reased 

£ease-incidence. maximum . severity. about this particular 
duration of disease but ^^tZ^T^Z^ animais (2.3) is unusually low. in a pilot 
5 522 rt™^ ^verity scores were Z and 4. it may be possible to op>m*e »e 
effects of rCOP-1 by varying the treatment procedure. 

10 temple 8 ■ Other Ap^n^tions for ^nriom Sequence Polymers of Amino Acids 

Genelencoding random sequence PfP'P" e ~^ 
using the procedures <^ e < ^ can be anticipated, 

amino acid composition and length. * d * bon *J?£^Z™o imposition may be useful additives to 
Random sequence polypepfdes of meTduction of disulfide bonds and the 
hai r care products. One common ,s ™« * ^ duce undes irab.e effects on hair, 

subsequent oxidation of cysteine res.dues to cyste £^ C ^ 9 ^ nair a„ d neutralize these effects. 
Random sequence polypepbdes may be able to w ™ ^sequences will have different physical 

Polypeptides of different These molecules may confer 

properties such as charge, solubil.ty. and abiin> ' to » ^ Qf c<jmbing ^ 

beneficial effects on damaged l hjr such as ■»* pblypepbd.. For example, a 

hair. The effects may vary depending on the J>^=" P * neutralize the negative charge of 

86 lJ ^r1!^i^^ P°-VPeP«^3 ^ng a predeterm.ned amino acid composi- 

tion is as supplements for diets deficient in certain amino ac.ds. 

\ 

so Claims 

- ? is 1 .? atsrsss-. «— - — — 

g. A method, accord.ng to claim 1. w ? ere ' n **^ y acid and either tyrosine or tryptophan, 
substantially of alanine, lysin e f^^^^^X^ ^ 

10. A method, according to. claim 1. wnerein saio v iv r 
5,000 and about 50.000 daltons. synthetic gene codes for a polypeptide which is 

,1. A method, according to B ^^^^ r S2Jautoh«?un. encephalomyelitis. 

TTfEK^ s r encodes for 3 po,ypeptidev * ,ch 13 

eji of prevent, arresting, or controHln g a dernye«n „ multlpl sclerosis. 


35 


40 


45 


50 


55 


11 


10 


75 


20 


2S 


EP 0 383 620 A2 


redetermined amino add constituents, said method comprising the synthesis of genes encoding said 
™2m3vme^ slid synthesis comprising the polymerization of small oBgonudeotide duplexes. said 
SeS for Sg pontes further comprising the Coning of said synthetic genes into an expression 
^JSSS^^or Into a bacteria or eukaryotlc cel. capable of expressing said polypepWe. 
15 A method, according to claim 14. wherein said bacteria is an Escherichia »»• 
6 A method for maWng a fusion polypeptide having one portion compnsed of all °r Part of a 
h« Jrolooous Dolypeptide and a second portion comprised of a random sequence wherem said random 
ZeXorJonTafprUtermined amino add constituents, and said method comprises a"*^-* 
oenes ^ncodiSg said random polymeVs. said synthesis comprising the polymerization of smafl 
9 r It JE2ST duplexes said method further comprising the cloning of said synthetic genes into an 

Smng selector into a bacteria or eukaryotic ceU capable of expressing said fusion P***** 

17 /1 me Id for synthesizing and Identifying a biologically or Immunologically active polypeptide, or 
mixture of polypeptides, said method comprising the following steps: rt( »»-i-*h 
la) synthesizing genes encoding random polypeptides by polymenzabon of small oligonucleotide 

dUPl6X S doning each of said synthetic genes into a vector and transferring said vector into a host capable 

° f eXP ( c rS rJ^under conditions which permit the formation of recombinant co.onies which 

^ •%^£XS£Z a mature of the recombinant polypeptides combined either before or 

after isolation from said colonies, for evidence of biological and/or immunological activity. 

after isolation ^ observe(j ^ ^ mixture of polv p eptides . generating smaller subsets of that 

combination and testing each of these subsets for biological activity: and obta ined 
<f> repeating step (e) until the active component(s) or a suitable mixture thereof is obtamed. 

18. A synthetic gene which codes for rCOP-1-19. 

19. A synthetic gene which codes for rCOP-1-77. 
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Strateav for Synthesizing Random Genes and 
IdentifyinrRecornbinant Polypeptides with Biological Actlv.ty 


figure 1 


mixture o( 

oligonucleotide 

duplexes 


ligale 



NAAAA/ 

NAAAA/ 


random sequence 
synthetic genes 


clone in an expression vector 
transform into hosl 


recombinant colonies, 
each contains one 
synthetic gene 


identification of biologically active polypeptides 


f 

pool colonies (e.g.1000) 

I 

isolate polypeptides 
test lor biological activity 

Iractionate Into smaller pools of 
colonies (e.g. 10 pools 1100 
colonies) 


select individual colonies 
isolate Ihe polypeptide 
test lor biological activity 


EP 0 383 620 A2 


Oligonucleotides 
(9-mers) 


anneal 


Duplexes 


ligate 


Synthetic 
Genes 



expression 


Polypeptides 


Ala-Ala-Ala-Lys-Lys-Lys-Ala-Ala-Ala-Lys-Lys-Lys-Ala-Ala-Ala 
Ala-Ala-Ala-Ala-Ala-Ala-Lys-Lys-Lys-Lys-Lys-Lys-Ly^-Lys-Lys 
Lys-Lys-Lys-Ala-Ala-Ala-Ala-Ala-Ala-Lys-Lys-Lys-Ala-Ala-Ala 
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Figure 3 


All Possible 3 Amino Acid Combinations 
and Their Percent Occurence in Cop 1 


AAA 

7.872 

AEA 

2*624 

ANA 

6 . 560 

AYA 

1,312 

EAA 

2.624 

EEA 

0 ,875 

EKA 

2 . 1137 

t i A 

0 ,437 

KAA 

6 , 560 

KEA 

2. 1U7 

KKA 

5, 466 

KYA . 

1 .073 

YAA 

1 .3 1*2 

YEA 

0 ,437 

YKA 

1 .093 

Y Y A 

0,219 


AAE 

2.624 

AEE 

0 . 875 

AKE 

2. 187 

AYE 

0 . 437 

EAE 

0.875 

EEE 

0 .292 

EKE 

0.729 

EYE 

0.146 

KAE 

2. 187 

KEE 

0.729 

KKE 

1 . 822 

K YE 

0.364 

YAE 

0.437 

YEE 

0.146 

YKE 

0.364 

YYE 

0.073 


AAK 

6 .560 

AEK 

2 . 187 

AKK 

5. 466 

AYK 

1 .093 

EAK 

2. 187 

EEK 

0 .729 

EKK 

1 .822 

EYK 

0 .364 

KAK 

5 . 466 

KEK 

1 . B22 

KKK 

4.555 

KYK 

0.911 

YAK 

1 . 093 

YEK 

0 . 364 

YKK 

0.911 

YYK 

0 . 182 


AAY 

1 .312 

AEY 

0 .437 

AKY 

1 .093 

AYY 

O .219 

EAY \ 

0 . 437 

EE Y 

0. 146 

EKY 

0.364 

EY Y 

0 .073 

KAY 

1 .093 

KEY 

0 .364 

KKY 

0.911 

KYY 

0 . 182 

YAY 

0.219 

YEY 

0 . 073 

YKY 

0 . 182 

YY Y 

0 i 036 


A=Alanine 
E=Glutamic Acid 
K=Lysine 
Y=Tyrosine 


# 
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Figure 4 

9-NucIeotide Duplexes and Adaptors 
for Cop 1 Gene Synthesis 


Examples of 9-Nucleotide Duplexes for Cop 1 Genes: 

c 0din9 a T r T ; G c c G A TCT A 

Noncoding (J.nd)T T T C T T C C B (?) 


Amino acids 


E 


Adaptors: 


GATC * 

For the 5* end ^ 

(BamH I) 


® 

t L__ AGCT 

For the 3' end 

(Sac I) 
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Figure 5 

Construction and Cloning 
Synthetic COP 1 Genes 


5* adaptor 


9-mers 


3 1 adaptor 


GATC- 
(BamH !) 


GATC- 
(BamH \] 


AGCT 


.A. 
-T- 


3> 


ligase 



(Sac I) 


.A. 


-A 
-T 


•AGCT, 


-T- 


(Sac 1) 


size-select genes 


GATC- 


clone into vector 


/AGCT Synthetic Gene 


CTAG TCGA 


Plasmid Vector 
cut with BamH 1 
and Sac I 
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Figure 6 

Sequence Analysis of a 
Synthetic Gene 


amino acids: Y K KEA E K ■ A K 
nucle otides: T A C A A G A A a£ A A G C A G A a|* A Q G C T A A A j 

JV KKY KKK KA 
amino acids: Y * ^ i 
nuclides: T AC A AG A A a|t AC A AG A A A^A3 A AG G C A| 


amino acids: E 
nucleotides 


K K A E A E 


' : GA agcagaa|a.agaaggca(3aagcagaa1 

urKAEAEE V AE 
amino acids: K K a 

nuclides: AAGAAGGC a|g AAGCAGAApAAGCAGAAj 

amino acids: K K A K A K E A E 
nucleotides: AAGAAGGC a|a AGGCTAAAjjAAGCAGAA | 

K KAKK AKAK 
A A G A A G G C a|a A G A A G G C a|a AGGCTAAAj 


amino acids: K 
nucleotides 


The lines in the nucleotide sequence mark the Junctions 
between 9-mer duplexes. 


Amino Acids: 
A= alanine 
Kslysine 
E=gluiamic acid 
Y=iyrosinc 
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Figure 7 
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Figure 8 

Variations of Random-Gene Synthesis 
using Small Duplexes 


One Nucleotide Extension: 


GCA GAA 
3-mers JCG TCT 


GCAAAA T GAAGCA 
6-mers JCGTTT TCTTCG 


GCAG AAGCAA AA 
12-mers TCGTCTTCGTTT 


Three Nucleotide Extensions. 12-mer Duplexes: 

duplex XXXXXXXXXGCA 
H CGTXXXXXXXXX 


1 


ligate 


Gene xxxjbcxxxxGCAXXxxxxxxxGCAJoaaooooaccA 

9 C GTXXXXXXXXXCGTXXXXXXXXXCGTXXXXXXXXX 


polypeptide 


Ala aa aa aa Ala aa aa aa Ala aa aa aaa Ala 


xxx» any codon 

aa~ amino acid specified 

by cod nxxx 
Ala» alanine 


* 
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Figure 9 

Synthesis of Single-Stranded DNA Encoding 
Random Sequence Amino Acid Polymers 


3 Nucleotide Segments for COP 1 

Ala 
*GCA 3 ' 
ratio: 6 


Glu 
5 GAA 3 ' 


Lys 
5 AAA* 
5 


3' 


Tyr 

5 TACT 
1 


Polymerize 

A. Solution Chemistry 
or B. DNA Synthesizer 


Single-Stranded DNA 5 f 


5' 


Double-Stranded DNA gf 


—3' 


Enzymatic Synthesis 
of the Second Strand 


.3" 
.5' 


Clone into 
Expression Vector 
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Figure 10 


DMTr 
I 

O 
I 

CH 


Phosphoramtdlte Trinucleotide for Random 
Gene Synthesis Using a DNA Synthesizer 


H 2 ?1 


o=p— O 


Phosphoramidite 
Trinucleotide 


? 

ch 2 

? „ 

o=p— O 



CN-CH2-CH2O X N-CH(CH3)2 
CH(CH3)2 


Phosphoramidite 
Mononucleotide 


DMTr 

? 

CH 2 


P 

CH30 |vl— CH(CH3)2 
CH (CH3)2 
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FIGURE 11 
rCOP-1-77 



CGTATC-TTCTTTCTTCGTCTTA' 
. Y K K E- A E Y K K Y - 

CAG AA T A^^ 

m KKKA KEAEKAK KAKYI^K 

A ^ Y m G AAG CTG CTAAAGCTG CTAAAG CTGCTG CAG CTG CTG C AT AC AAG 
TAC^GAAAGAAGCAG^GCTGCTAAAe ^ ^"^GTATGTTC 

AAAGAAGCAGAAGCTGCTGCA^ +-- r , TTTCTT 

. , . T K * * * * « * * ****** * 

-"TrZZZZZrZg^ccTCTTCGTCTTTTCCGATTTCTTCGTCTTATG 


AAAGAAG C AG AAAAGG C 
TTTCTTCGTCTTTTCCGATTTCGAO 


KAKAAAEAEKAKE 

LLIcLg^g^g^g^ggct^g^cag^ 

KKYKKSAEKAKEXS.* 
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FIGURE 12 
rCOP-1-1 9 


GCA^GGCTGCAGAG^GCAAA J— ~^^^TCTCTTTCGTTTC 

+ ;i;; r ^ TCG TTTCCGACGTTTCTTCCGT«TGC-ri^ 

CGTTTCCGACGTCTCx.TCGIH _ ^ ^ „ 


A A K K 


A Y E A E * 



IcTTCGTTTCTTCCGTCTCTTT 


CGXTTTATGCTTCCXx. kaeKaE KA*A* ' 

A K Y E A K . _ - c > gAGAAAGCAAAGGAA 

KKAKEAK^a AGAC -AAAGCAAAG 

gcagaga^gca^^^^ 



KXKA AYKK 


TTCCGTCTCTTTCGTTTCCGACGTCTCTTTCGTTTC ^ ^ ^ ^ 

KAE KAKAAE 
GCGAAAGCAAAGGCTGCATAA 
• CGCTTTCGTTTCCGACGTATT 



